๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ”ค Morphological Analysis

Natural Language Processing, Historical Linguistics, Word Structure, Computational Morphology

Mind the Gap: Assessing Wiktionary's Crowd-Sourced Linguistic Knowledge on Morphological Gaps in Two Related Languages
arxiv.orgยท1d
๐Ÿค–Automated Parsing
davidchisnall/igk: I got Knuth'd: A compiler for documents
github.comยท8h
๐Ÿ“Concrete Syntax
Stop Words Using Spacy - NLP
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Text Parsing
Linguistics vs. archeology and (physical) anthropology
languagelog.ldc.upenn.eduยท14h
๐Ÿ“กFrequency Archaeology
The more LLMs think, the worse they translate
nuenki.appยท3hยท
Discuss: Hacker News
โš™๏ธCompression Benchmarking
Cactus Language โ€ข Syntax 11
inquiryintoinquiry.comยท1d
๐Ÿ“Concrete Syntax
Kumo Surfaces Structured Data Patterns Generative AI Misses
thenewstack.ioยท53m
๐Ÿ“ŠGraph Databases
Text2Struct: A Machine Learning Pipeline for Mining Structured Data from Text
arxiv.orgยท1d
๐Ÿ”คCharacter Classification
Why Your Next LLM Might Not Have A Tokenizer
towardsdatascience.comยท19h
๐Ÿค–Grammar Induction
Lemmatization as a Classification Task: Results from Arabic across Multiple Genres
arxiv.orgยท1d
๐Ÿค–Grammar Induction
Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems
arxiv.orgยท1d
๐Ÿ“œDigital Philology
DRIFT: Data Reduction via Informative Feature Transformation- Generalization Begins Before Deep Learning starts
arxiv.orgยท10h
๐Ÿง Machine Learning
Accumulation of Cognitive Debt When Using an AI Assistant for Essay Writing Task
media.mit.eduยท1dยท
Discuss: Hacker News
๐Ÿง Intelligence Compression
Machine Learning Fundamentals: active learning
dev.toยท22hยท
Discuss: DEV
๐Ÿค–Grammar Induction
Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages
arxiv.orgยท1d
๐Ÿ‘๏ธMedieval OCR
PATCH! {P}sychometrics-{A}ssis{T}ed Ben{CH}marking of Large Language Models against Human Populations: A Case Study of Proficiency in 8th Grade Mathematics
arxiv.orgยท10h
๐Ÿง Intelligence Compression
QuranMorph: Morphologically Annotated Quranic Corpus
arxiv.orgยท1d
๐Ÿ“‹Document Grammar
Multilingual Tokenization through the Lens of Indian Languages: Challenges and Insights
arxiv.orgยท1d
๐Ÿ“Text Parsing
Markov-Enhanced Clustering for Long Document Summarization: Tackling the 'Lost in the Middle' Challenge with Large Language Models
arxiv.orgยท1d
๐Ÿ“„Text Chunking
The Internal Inconsistency of Large Language Models
blog.kortlepel.comยท22hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
Loading...Loading more...
AboutBlogChangelogRoadmap